FarsTail: a Persian natural language inference dataset

نویسندگان

چکیده

Natural language inference (NLI) is known as one of the central tasks in natural processing (NLP) which encapsulates many fundamental aspects understanding. With considerable achievements data-hungry deep learning methods NLP tasks, a great amount effort has been devoted to develop more diverse datasets for different languages. In this paper, we present new dataset NLI task Persian language, also Farsi, dominant languages Middle East. This dataset, named FarsTail, includes 10,367 samples are provided both well indexed format be useful non-Persian researchers. The generated from 3,539 multiple-choice questions with least annotator interventions way similar SciTail dataset. A carefully designed multi-step process adopted ensure quality We results traditional and state-of-the-art on FarsTail including embedding such word2vec, fastText, ELMo, BERT, LASER, modeling approaches DecompAtt, ESIM, HBMP, ULMFiT provide solid baseline future research. best obtained test accuracy 83.38% shows that there big room improving current real-world applications investigate extent models exploit superficial clues, biases, partition set into easy hard subsets according success biased models. available at https://github.com/dml-qom/FarsTail

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Constructing a Natural Language Inference dataset using generative neural networks

Natural Language Inference is an important task for Natural Language Understanding. It is concerned with classifying the logical relation between two sentences. In this paper, we propose several text generative neural networks for generating text hypothesis, which allows construction of new Natural Language Inference datasets. To evaluate the models, we propose a new metric – the accuracy of th...

متن کامل

Natural Language Understanding for Grading Essay Questions in Persian Language

many intelligent systems are intended to communicate with users through natural language. Understanding the natural language by the computer is one of the most essential operations in natural language processing. One of the applications of natural languages is in the exams having essay questions. The objective of this paper is to propose a method for designing an examiner machine and creating a...

متن کامل

Natural logic and natural language inference

We propose a model of natural language inference which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We extend past work in natural logic, which has focused on semantic containment and monotonicity, by incorporating both semantic exclusion and implicativity. Our model decomposes an inference problem into a sequence of atomic edits lin...

متن کامل

Natural Language Inference in Coq

In this paperwe propose away to dealwith natural language inference (NLI) by implementing Modern Type Theoretical Semantics in the proof assistant Coq. The paper is a first attempt to deal with NLI and natural language reasoning in general by using the proof assistant technology. Valid NLIs are treated as theorems and as such the adequacy of our account is tested by trying to prove them. We use...

متن کامل

Generating Natural Language Inference Chains

The ability to reason with natural language is a fundamental prerequisite for many NLP tasks such as information extraction, machine translation and question answering. To quantify this ability, systems are commonly tested whether they can recognize textual entailment, i.e., whether one sentence can be inferred from another one. However, in most NLP applications only single source sentences ins...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Soft Computing

سال: 2023

ISSN: ['1433-7479', '1432-7643']

DOI: https://doi.org/10.1007/s00500-023-08959-3